L Number 


Hits 


Search Text 


DB 


Time stamp 


1 


36 


probability with answer with selected 


USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/31 08:25 


2 


33 


(probability with answer with selected) and (@rlad<«20010131 or 
@ad<=20010131) 


USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/31 09:26 


3 


4 


(ineffective adj question) and (@rlad<=20010131 or @ad<=20010131) 


USPAT; 

US-PGPUB; 
EPO* JPO- 
IBM TDB 


2003/10/31 09:45 


4 


0 


(ineffective adj survey adj question) and (@rlad<=20010131 or @ad<«200l0131) 


USPAT; 


2003/10/31 09: 30 






US-PGPUB; 
EPO- JPO- 
IBM TDB 




5 


0 


(ineffective with ( survey adj question)) and (@rlad<=20010131 or 
(gad<«20010131) 


USPAT, 
US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/31 09: 30 


6 


1 


(ineffective same ( survey adj question)) and (@rlad< =200101 31 or 
@ad<=200lOl3l) 


USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/31 09: 39 


7 


1 


(ineffectiveSS same ( survey adj question)) and (@rlad< =200101 31 or 
@ad<=200l0131) 


USPAT; 

US-PGPUB; 
EPO* JPO- 
IBM TDB 


2003/10/31 09:40 


8 


61 


(ineffective$5 with question) and (@rlad<«2 00101 31 or @ad< =200101 31) 


USPAT; 

US-PGPUB; 
EPO- JPO* 

XjX V-'^ JX V/j 

IBM TDB 


2003/10/31 09:40 


9 


3 


((ineffective$5 with question) and ((grlad<«20010131 or @ad<=20010131)) and 
survey 


USPAT; 

US-PGPUB; 
FPO- IPO- 

x^x KJ ; J A \j f 

IBM TDB 


2003/10/31 09: 43 


10 


0 


4627818.pn 


USPAT; 

US-PGPUB, 
FPO- IPO- 

x^x JX \Jf 

IBM TDB 


2003/10/31 09: 43 


11 


1 


4627818.pn. 


USPAT; 

US-PGPUB, 
FPO- TPO- 
IBM TDB 


2003/10/31 09:43 


12 


16 


(honest$2 adj (response answer)) and (@rlad< -200101 31 or (gad<-.200l0l31) 


USPAT; 

US-PGPUB, 

EPO; JPO- 

IBm' TDB 


2003/10/31 09:49 


13 


51 


(survey adj respondent) and (@rlad<=20010131 or @ad<=20010131) 


USPAT; 


2003/10/31 09:49 






EPO; JPO, 

IBM TDB 




14 


15 


("5237678" 1 "5311173" 1 "5465321" 1 "5608840" 1 "5638489" t "5704017" 1 "5884311" 1 
"5930803" 1 "6012058" I "6021383" 1 "6032146" 1 "6049797" 1 "6067572" 1 "6115708" 1 
"6144838").PN. 


USPAT 


2003/10/31 10:41 


19 


1 


6374251.pn. 


USPAT; 

I JS-PfiPT IR. 
EPO; JPO, 

IBM.TDB 


2003/10/31 12:05 


20 


10 


(US-6633882-5 or US-5832108-$ or US-A616458-S or T J*\-64A0035-S1 did or 
(US-20030088458-S or US-20010014868-S or US-20010027455-$ or 
US-20020188689-$ or US-20020046199-$ or US-20020019747-S).did. 


TNPAT. 

US-PGPUB 


l\J\j3/1\j/dL il'.kjO 


21 


7 


((US-6633882-$ or US-5832108-$ or US-6616458-S or US-6460035-$).did. or 
(US-20030088458-$ or US-20010014868-$ or US-20010027455-$ or 
US-20020188689-$ or US-20020046199-$ or US-20020019747-$).did.) and percentSS 


USPAT; 

\JtJ X\JX\^XJ 


2003/10/31 12:06 


- 


6 


best adj fit adj clusters 3 


USPAT; 

US-PGPUB; 

EPO, JPO; 

IBM TDB 


2003/10/29 15:54 




12027 


champion adj chalenge clustering 


USPAT; 

US-PGPUB; 

EPO, JPO; 

IBM TDB 


2003/10/08 15: 34 
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11 


champion adj challenger 


USPAT; 


2003/10/09 12:57 






US-PGPUB; 
EPO; JPO; 

roM TDB 






9 


champions 1 challenger 


USPATf 
US-PGPUB; 

EPO; JPO; 

IBM TDB 


2003/10/09 13:01 


- 


669 


marketing adj strategy 


USPAT; 


2003/10/09 13:02 






US-PGPUB; 

EPO; JPO; 

IBM TDB 




- 


0 


championSlchallenger and (marketing adj strategy) 


USPAT; 

US-PGPUB; 
EPO* JPO- 
IBM TDB 


2003/10/09 13:02 




53 


(marketing adj strategy) and clusters 3 


USPAT; 

US-PGPUB; 

EPO; JPO; 

IBm' TDB 


2003/10/09 13:02 


- 


0 


((marketing adj strategy) and clusters 3) and bestfit 


USPAT; 


2003/10/09 13:54 






US-PGPUB; 
EPO- JPO- 
IBM TDB 




- 


1 


((marketing adj strategy) and cluster$3) and best-fit 


USPAT; 


2003/10/09 13:02 






US-PGPUB; 
EPO* JPO- 
IBM TDB 






3 


((marketing adj strategy) and clusters 3) and best adj fit 


USPAT; 

US-PGPUB; 

EPO; JPO; 

IBm' TDB 


2003/10/09 13:48 


- 


0 


6567786.1JRPN. 


USPAT 


2003/10/09 13:05 




19 


("4908761" 1 "5227874" 1 "5278751" 1 "5717923" 1 "5740549" 1 "5758257" 1 "5774868" 1 
"5819241" 1 "5819285" 1 "5832457" 1 "5914670" 1 "5930764" 1 "5933811" 1 "5937387" 1 
"5940809" 1 "5946661" 1 "5970464" 1 "6267672" 1 "6286005").PN. 


USPAT 


2003/10/09 13:06 


- 




5,659,626 


USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/09 13:49 


- 


1 


5,659,626.pn. 


USPAT; 

US-PGPUB; 

EPO; JPO; 

IBm' TDB 


2003/10/09 13:49 


- 


16 


(bestfit or( best adj fit) or best-fit) adj (clusters 3 or segment$6) 


USPAT; 

US-PGPUB; 

EPO; JPO; 

IBM TDB 


2003/10/09 14: 35 




0 


6438579.URPN. 


USPAT 


2003/10/09 13:57 




12 


("5237157" 1 "5678041" 1 "5724567" 1 "5918213" 1 "5963916" 1 "5991799" 1 "6041311" 1 
"6049777" 1 "6092049" 1 "6112186" 1 "6330592" 1 "6334127").PN. 


USPAT 


2003/10/09 13:57 
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C:\APPS\EAST\Workspaces\09867803.wsp 



USPAT; 

US-PGPUB; 

EPO; JPO; 

IBM TDB 



100 



Search Histoiry 10/31/03"f:W:%^PM Page 3 

r "5774-662" 
C: \APPS\E^STVWorks|)?Mi?^f^|^803.wsp 

"5850350^ 



'5202955 
5216749 
6246880' 
5761472 
5991340' 
6159766 
5623425 
5835774' 
6062190' 
6351569 
5625374' 
6018575 
6085173 
6189792 
4304204' 
4967370' 
6047287 
6068660 
6205009 
6262085 
6262086 
6263355 
6268385 
6312712 
6312723 
6316020 
6326372 
6326384 
6369087 
6444689 
5187773^ 
5522014^ 
5537644' 
6373408 
5485531 
5494624^ 
5528524 
5606248^ 
5677844' 
5809304^ 
5373220 
5625823 
6144984^ 
'4774683^ 
'4805230 
•4862399' 
'4870594' 
'4951189 
'4972310' 
5235430' 
5243514' 
5297307' 
5339128 
5345159' 
5348705 
5362648 
5375061 
5394337 
5400415 
5418917 
5424248 
5442416 
5545963 
5588008 
5615822 
5623505 
5623651 
5629877 
5638291 
5684810 
5692153 
5702625 
5704033 
5708342 
5731869 
5737228 
5751365 



).pn. 



2003/10/09 14:46 




- 


2715 


besttit 




USPAT, 
US-PGPUBf 
EPO- JPO* 
IBM TDB 


2003/10/09 14:46 


- 


424 


bestfit and clustering 




USPATi 


2003/10/09 14:47 








US-PGPUB; 

EPO; JPO; 

IBm' TDB 






2 


(bestfit and clustering; and marketing 




USPAT; 


2003/10/09 14:46 








US-PGPUB; 

EPO; JPO; 

IBM TDB 




- 


44 


(bestfit and clustering) and survey 




USPAT; 


2003/10/09 14:47 








US-PGPUB; 

EPO; JPO; 

IBM TDB 




- 


1025 


707/6. ccls. 




USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/09 16: 20 


- 


3258 


707/lo.ccls. 




USPAT; 

US-PGPUB; 

EPO; JPO; 

IBM TDB 


2003/10/09 16:20 


- 


2605 


707/104.1xcls. 




USPAT; 

US-PGPUB; 

EPO; JPO; 

IBm' TDB 


2003/10/09 16:20 


- 


1 


(707/6.ccls. 707/lo.ccls. 707/l04.l.ccls. ) and bestfit 




USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/09 16:20 


- 


366 


(707/6.ccls. 707/10. ccls. 707/104. l.ccls. ) and clustering 




USPAT; 


2003/10/09 16:20 








US-PGPUB; 
EPO* JPO- 
IBM TDB 




- 


30 


C(707/6.ccls. 707/10. ccls. 707/1 04. l.ccls. ) and clustering) and survey 




USPAT; 


2003/10/09 16: 21 








US-PGPUB; 

EPO; JPO; 

IBm' TDB 




- 


1918 


705/14.ccls. 




USPAT; 

US-PGPUB; 
EPO- JPO* 
IBM TDB 


2003/10/09 16:21 


- 


1086 


705/lO.ccls. 




USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/14 08:24 


- 


1 


(705/14xcls. 705/10.ccls.) and bestfit 




USPAT; 

US-PGPUB; 

EPO; JPO; 

IBm' TDB 


2003/10/09 16:21 


- 


79 


(705/14xcls. 705/lO.ccls.) and clustering 




USPAT; 


2003/10/09 16:21 








US-PGPUB, 
EPO- JPO- 

x^x 1 *f X v</ ^ 

IBM TDB 




- 


0 


706.45.ccls. 




USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/09 16:21 


- 


598 


706/45. ccls. 




USPAT; 

US-PGPUB; 
EPO- JPO- 
IBm' TDB 


2003/10/09 16:21 


- 


109 


706/48.CCls. 




USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/09 16:21 


- 


387 


706/52.CCls. 




USPAT; 

US-PGPUB; 

EPO; JPO; 

IBM TDB 


2003/10/09 16:22 




1 


(706/45.ccls. 706/48.ccls. 706/52. ccls.) and bestfit 




USPAT; 

US-PGPUB, 

EPO; JPO; 

IBM TDB 


2003/10/09 16:22 
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0 


survey adj question&5 


USPAT, 
US-PGPUB; 

EPO; JPO; 

IBM TDB 


2003/10/10 09: 34 


- 


365 


survey adj question 


USPAT; 

US-PGPUB; 
EPO- JPO- 
mU TDB 


2003/10/10 09: 34 


- 


442 


survey adj question$5 


USPAT; 

US-PGPUB; 
EPO JPO- 
IBM_TDB 


2003/10/10 09: 34 


- 


3 


(survey adj question$5J with effectiveness 


USPAT; 

US-PGPUB- 

EPO; JPO; 

IBM TDB 


2003/10/10 09:46 


- 


18 


("4483681" 1 "4755045" 1 "5227874" 1 "5243517" 1 "5870559" 1 "5878384" 1 "5911581" 1 
"5945991" 1 "5951642" 1 "5968125" 1 "6006197" 1 "6008807" 1 "6070145" 1 "6144991" 1 
"6228038" 1 "6260064" 1 "6275854" 1 "6286005"),PN. 


USPAT 


2003/10/10 09:40 




12 


(survey adj questions 5) same effectiveness 


USPAT- 

EPO- JPO- 
IBM TDB 


2003/10/10 10- 10 


- 


13 


("4451700" 1 "4937439" 1 "4992939" 1 "5218535" 1 "5365425" 1 "5551880" 1 "5684964" I 
"5717865" 1 "5726914" 1 "5737494" 1 "5740035" 1 "5893098" 1 "6007340"J.PN. 


USPAT 


2003/10/10 09: 57 




4 


("4191472" 1 "4464122" 1 "5083270" 1 "5857174").PN. 


USPAT 


2003/10/10 10- Ofi 




5 


6038554.URPN. 


USPAT 


2003/10/1010:08 


- 


1086 


705/lO.CCls. 


USPAT; 

US-PGPUB; 
EPO- JPO- 

x^x j X 1 

IBM_TDB 


2003/10/1012:11 


- 


312 


705/iO.ccls. and survey 


USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/1012:11 


- 


78 


(705/lO.ccls. and survey) and score 


USPAT; 

US-PGPUB; 
FPO- TPO- 
IBM TDB 


2003/10/1012:11 


- 


48 


((705/10.ccls. and survey) and score) and (@rlad<= 200101 31 or (gad<=20010131) 


USPAT; 


2003/10/1013:22 






US-PGPUB; 
EPO- JPO- 
IBM TDB 






657 


(survey adj system) and (@rlad<=20010131 or (gad< =200101 31) 


USPAT; 

US-PGPUB, 
EPO- JPO- 
IBM TDB 


2003/10/10 13:22 




25 


((survey adj system) and (@rlad<=20010131 or (§ad<=20010131)) and score 


USPAT; 

US-PGPUB; 
FPO* TPO- 
IBM TDB 

XXfflVX X x^x^ 


2003/10/29 16- 14 




382 


response adj distribution 


USPAT; 

US-PGPUB; 


2003/10/14 08:24 








EPO- JPO* 

x^x \y ^ *i X \^ J 

IBM_TDB 














288 


(resDonse adi distribution) and (fa)rlad<=20010131 or ^S)ad<=200l0i3il 


USPAT - 








US-PGPUB; 

EPO; JPO; 

IBM_TDB 






9 


{"4847784" 1 "5574828" 1 "5692906" 1 "5727950" 1 "5749736" 1 "5797753" 1 "5870768" 1 
"5875431" I "6064856").PN, 


USPAT 


7nfl3/in/id OR- 


- 


6 


"5855011" 


USPAT; 

US-PGPUB, 
FPO- TPO- 
IBM TDB 


2003/10/14 09: 31 


- 


25 


(number adj response) with divid$3 with total 


USPAT, 
US-PGPUB; 

FPO. TPO- 
V7; Jr w; 

IBM TDB 


2003/10/1411:49 


- 


0 


babbitt adj score 


USPAT; 

US-PGPUB, 

EPO; JPO; 

IBM TDB 


2003/10/1411:49 




0 


(observation respondent) adj membership 


USPAT; 

US-PGPUB; 

EPO; JPO; 

IBM_TDB 


2003/10/15 12:42 
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23 


(observation respondent) adj clusters 3 


USPAT; 


2003/10/15 12- 51 






US-PGPUB; 










EPO; JPO; 










IBM TDB 






6 


6374251.URPN. 


USPAT 


2003/10/15 12:45 




3 


("5706503" 1 "5832182" 1 "5884305").PN. 


USPAT 


2003/10/15 12:48 




60 


(dummy adj variable) and clusters 3 


USPAT; 


2003/10/15 12: 51 






US-PGPUB* 










EPO- JPO- 










IBM TDB 






39 


((dummy adj variable) and clusters 3) and probability 


USPAT; 


2003/10/15 13- 10 






US-PGPUB- 










EPO- JPO- 










IBM TDB 






13 


(membership adj status) and clusters 3 


USPAT; 


2003/10/15 13* 11 








US-PGPUB- 










EPO- JPO- 










IBM TDB 






1099 


cluster adj analysis 


USPAT; 


2003/10/15 15- 36 






US-PGPUB- 










EPO- JPO- 










IBM TDB 






7 


(proxy adj value) and observation 


USPAT; 


2003/10/29 15-54 

fa VV^/ A V/ fa / * ^ . 






US-PGPUB- 










EPO- JPO- 










IBM TDB 






0 


DUTDOselv adi nrobabilitv 


USPAT; 


2003/10/29 15- 55 

fa W^/ AW/ fa 7 AJ.JJ 








US-PGPUB- 










EPO- JPO- 










IBM_TDB 






0 


serendipity adj probability 


USPAT* 


2003/10/29 15- 55 








US-PGPUB- 

X VJX V/X^^ 










EPO- JPO- 










IBM TDB 

XX^XtX X X^XJ 






12 


haves adi nrohahilitv 


USPAT- 


2003/10/29 15- 56 

fa V/V J/ A V// fa 7 iJt^O 








US-PGPUB- 










FPO- TPO. 










IBM TDB 






1253 


clusters! adi analvsts 


USPAT- 


faUUJ/XV//fa7 XO. U7 






US-PGPUB- 

Wk^ X VJX XJUf 










FPO- TPO- 










IBM TDB 

1 LJItX X l^LJ 






457 


(clusters 3 adj analysis) and observation 


USPAT- 


2003/10/29 lA- 09 

fa W J/ AV/ fa 7 AO • w7 






US-PGPUB- 










FPO- TPO- 










IBM TDB 






221 


ffcluster$3 adi analvsisl and nhservatioTi) and nrnhahilitv 


USPAT- 


fa W ^/ A v/ fa 7 1 o : u7 






US-PGPUB- 

\J ij X VJX WXJj 










EPO; JPO; 










IBM* TDB 








(((clusters 3 adj analysis) and observation) and probability) and proxy 


USPAT; 


2003/10/29 1A09 






US-PGPUB- 

\JiJ ITVJX uu; 










FPO- JPO- 










IBM TDB 

XX^ItX X x^x^ 






165 


(((clusters 3 adj analysis) and observation) and probability) and (@rlad<=20010l31 


USPAT; 


2003/10/29 lA- 14 

faUUJ/Aw/fa7 AOt IT 






or ^S)ad<=20010131) 


US-PGPUB- 

wo X VJX \Ji-t^ 








EPO- JPO 










IBM_TDB 






11 


ffffpliisfprt^ aHi analvsis) und nbsprvatinnl and nrnfiahilitvl anH ^/^rlaH^«7flni01 !1 


USPAT- 


700^/1 0/^n OQ. 7 1 






or ^ad<»20010131)) and haves 


US-PGPUB- 

wo X VJX \JU^ 








EPO- JPO- 

■1-<X V/f X V/) 










IBM TDB 






13 


fcnninint adi analvsisl and f^rlad<E=20010151 or ^a(i^=20ni015ll 


USPAT; 


700^/10/^0 09- 7? 

faUUJ/ AU/ yyj UT . fa fa 






US-PGPUB- 

wo X VJX WX^p 










EPO- JPO- 

J--r v/^ Jrv/| 










IBM_TDB 








QprpnHinifv with nrnhabilitv 

dCl (^llUll^lljr Willi ^lUUaui 11 Ijr 


USPAT- 


favUJ/lwJU lfa:H'3 






US-PGPUB- 

wo X VJX wij; 










EPO; JPO; 










IBM TDB 






1 


ideal adj response adj distribution 


USPAT; 


2003/10/30 12:47 






US-PGPUB, 










EPO; JPO; 










IBM TDB 
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0 


percent adj (proxy adj value) 


USPAT; 

US-PGPUB; 
IBM TDB 


2003/10/30 12:47 


- 


0 


percent$4 adj (proxy adj value) 


USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/3012:47 


- 


20 


percent$4 adj observation 


USPAT; 

US-PGPUB; 
EPO- JPO- 
IBM TDB 


2003/10/3012:49 


- 


128 


705/lO.ccls. and cluster$3 


USPAT; 

US-PGPUB; 
FPO. TPO. 
IBM TDB 


2003/10/3012:50 


- 


79 


(705/lO.ccls. and cluster$3) and ((grlad<=200l0131 or @ad<=20010l3l) 


USPAT; 

US-PGPUB, 
FPO. TPO. 

IBM TDB 


2003/10/30 14: 32 


- 


32 


({705/lO.ccls. and cluster$3) and (@rlad<=20010131 or @ad<=20010131)) and 
probability 


USPAT; 

US-PGPUB- 
EPO- JPO- 
IBM TDB 


2003/10/3012:50 


- 


24 


(((705/lO.ccls, and cluster$3) and (@rlad<=20010131 or @ad<=20010131)) and 
probability) and percent$4 


USPAT; 

US-PGPUB; 
FPO. TPO. 


2003/10/30 14:29 






invalid adj survey adj response 


IBM TDB 




- 


0 


USPAT; 

US-PGPUB; 
FPO. TPO. 

IBM TDB 


2003/10/30 14:29 


- 


2 


valid adj survey adj response 


USPAT; 

US-PGPUB; 
FPO- TPO. 
IBM TDB 


2003/10/30 14: 32 


- 


119 


survey adj analysis 


USPAT; 

US-PGPUB; 
FPO- TPO. 
IBM TDB 


2003/10/30 14: 32 


- 


14 


(survey adj analysis) and Cluster$3 


USPAT; 

US-PGPUB; 

FPO. TPO. 

IBM TDB 


2003/10/30 14: 32 


- 


6 


((survey adj analysis) and Cluster$3) and (@rlad<«20010131 or (gad<«20010131J 


USPAT; 

US-PGPUB; 

FPn. TPC\. 

IBM TDB 


2003/10/30 14:43 


- 


926 


(cluster$3 adj analysis) and (@rlad< =200101 31 or @ad<=20010131) 


USPAT; 

US-PGPUB; 
FPO. TPO. 

IBM TDB 


2003/10/30 14:44 


- 


340 


((cluster$3 adj analysis) and ((grlad<«20010131 or @ad<=20010131)) and 
probability 


USPAT; 

US-PGPUB; 
FPO. TPO. 
IBM TDB 


2003/10/30 14:44 


- 


19 


(((cluster$3 adj analysis) and (@rlad<«20010131 or @ad<=20010131)) and 
probability) and bayes 


USPAT; 
FPO- TPO. 

IBM TDB 


2003/10/30 15:53 


- 


4 


customer adj prospecting 


USPAT; 

US-PGPUB; 

FPO. TPO. 


2003/10/30 15:58 






prospective adj customer 


IBM TDB 




- 
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1 Concepts and effectiveness of the cover-coefficient-based clustering methodology for 
text databases 
Fazli Can, Esen A. Ozkarahan 

December 1990 ACM Transactions on Database Systems (TODS), volume i5 issue 4 

Additional Information: full citation , abstract , references , citings , index 
terms , review 



Full text available: ^ pdf(274MB) 



A new algorithm for document clustering is introduced. The base concept of the algorithm, 
the cover coefficient (CC) concept, provides a means of estimating the number of clusters 
within a document database and related indexing and clustering analytically. The CC concept 
is used also to identify the cluster seeds and to form clusters with these seeds. It is shown 
that the complexity of the clustering process is very low. The retrieval experiments show 
that the information-retrieval effectiv ... 

Keywords: cluster validity, clustering-indexing relationships, cover coefficient, decoupling 
coefficient, document retrieval, retrieval effectiveness 



2 Intrusion detection and response: An empirical analysis of NATE: Network Analysis of Q 
Anonnalous Traffic Events 
Carol Taylor, Jim Alves-Foss 

September 2002 Proceedings of the 2002 workshop on New security paradigms 

Full text available: ^ pdf(899.25 KB ) Additional Information: full citation , abstract , references 

This paper presents results of an empirical analysis of NATE (Network Analysis of Anomalous 
Traffic Events), a lightweight, anomaly based intrusion detection tool. Previous work was 
based on the simulated Lincoln Labs data set. Here, we show that NATE can operate under 
the constraints of real data inconsistencies. In addition, new TCP sampling and distance 
methods are presented. Differences between real and simulated data are discussed in the 
course of the analysis. 

Keywords: intrusion detection, statistics, traffic analysis 



3 Modellin g the Internet: On characterizing affinity and its innpact on network 
performance 

Gabriel Lucas, Abhishek Ghose, John Chuang 
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August 2003 Proceedings of the ACM SIGCOMM workshop on Models, methods and 
tools for reproducible network research 

Full text available: Q pdf(236.10 KB) Additional Information: full citation , abstract , references 

An important connponent of sinnulation-based network research is the selection of nodes to a 
member group, such as receivers in a multicast group or web clients in a content delivery 
network. In a seminal paper, Philips et al, introduce an algorithm for generating member 
groups with different degrees of affinity (clusteredness) and show that affinity can have a 
significant effect on multicast efficiency. Subsequent studies applying this algorithm have all 
used the algorithm's input paramete ... 



4 DATAPLOT — an interactive high-level lan g uage for g raphics, non-linear fittin g , data 
analysis, and mathematics 
James J. Filliben 

August 1981 Proceedings of the 8th annual conference on Computer graphics and 
interactive techniques 

Full text available: ^pdf( 1.16 MB ) Additional Information: full citation , abstract , references , index terms 

This paper describes the design philosphy and features of DATAPLOT— a high-level (free- 
format English-like syntax) language for: 1) graphics (continuous or discrete); 2) fitting 
(linear or non-linear); 3) general data analysis; 4) mathematics. DATAPLOT was developed 
originally in 1977 in response to data analysis problems encount ... 

Keywords: Computer graphics, Data analysis, Diagrams, Fitting, Graphics languages, High- 
level languages, Interactive computing. Mathematical modeling, Mathematics, Modeling, 
Portability, Scehematics, Software, Statistics 



Statistics and data mining: intersecting disciplines 
David J. Hand 

June 1999 ACM SIGKDD Explorations Newsletter, volume i issue i 

Full text available: ^ pdf(487.82 KB) Additional Information: full citation , abstract , references 

Statistics and data mining have much in common, but they also have differences. The 
nature of the two disciplines is examined, with emphasis on their similarities and 
differences. 

Keywords: knowledge discovery, statistics 



Partition testing, stratified sampling, and cluster analy sis 
Andy Podgurski, Charles Yang 

December 1993 ACM SIGSOFT Software Engineering Notes , Proceedings of the 1st ACM 

SIGSOFT symposium on Foundations of software engineering, volume i8 
Issue 5 

Full text available: ^ pdf (1.35 MB ) Additional Information: full citation , references , citin gs, index terms 
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Contributed articles: Genetic subtyping using cluster analysis 

Tonn Burr, James R. Gattiker, Greggory S. LaBerge 

July 2001 ACM SIGKDD Explorations Newsletter volume 3 issue i 

Full text available: ^ pclf(984.40 KB) Additional Information: full citation , abstract , references 

In this paper we (1) describe state-of-the-art methods to identify clusters in DNA sequence 
data for taxonomic analysis; (2) describe a new method with better scaling properties based 
on model-based clustering, and (3) present examples using the nucleoprotein and 
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hemagglutin regions of influenza and the env and gag regions of human immunodeficiency 
virus (HIV). 

Keywords: DNA sequence analysis, HIV, influenza, model-based clustering, phylogenetic 
trees 



Data clusterin g : a review 

A. K. Jain, M. N. Murty, P. J. Flynn 

September 1999 ACM Computing Surveys (CSUR), Volume 31 Issue 3 

Full text available: H pdf(636.24 KB) Additional Information: full citation , abst[act. references, citings, index 

terms , review 

Clustering is the unsupervised classification of patterns (observations, data items, or feature 
vectors) into groups (clusters). The clustering problem has been addressed in many 
contexts and by researchers in many disciplines; this reflects its broad appeal and 
usefulness as one of the steps in exploratory data analysis. However, clustering is a difficult 
problem combinatorially, and differences In assumptions and contexts in different 
communities has made the transfer of useful generic co ... 

Keywords: cluster analysis, clustering applications, exploratory data analysis, incremental 
clustering, similarity indices, unsupervised learning 



9 Multiple molecular dynamics simulations of a 28mer oligopeptide reveal enhanced 
samplin g of conformational space 
Sangeeta Sawant, A. S. Kolaskar 

January 2003 Proceedings of the First Asia-Pacific bioinformatics conference on 
Bioinformatics 2003 - Volume 19 

Full text available: g pdf(1.53 MB) Additional Information: full citation , abstract , references . Index terms 

A strategy using multiple runs of MD simulations has been used for conformational searches 
of 28-mer peptide PSV which demonstrate the effectiveness of the strategy to sample the 
conformational space more efficiently than the single, long MD simulation. Further, the use 
of cluster analysis and Co distances has been made to detect native like folds of the peptide 
that are generated in the relatively short time scale simulations. 

Keywords: cluster analysis, conformational searches, multiple MD simulations, oligopeptide 



A general probabilistic framework for clustering individuals and objects 
Igor V. Cadez, Scott Gaffney, Padhraic Smyth 

August 2000 Proceedings of the sixth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: ^ pdf( 21 4.48 KB) Additional Information: full citation , references , citings. Index terms 



Keywords: EM algorithm, clustering, mixture models 



An optimal repartitionin g decision policy 
David M. Nicol, Paul F. Reynolds 

December 1985 Proceedings of the 17th conference on Winter simulation 

Full text available: ^ pdf (632.09 KB ) Additional Information: full citation , abstract , references , citin gs 



http://portal.acm.org/resultsxfo?coll-ACM&dl=ACM&CFID=13636245&CFTOKE 



The automated partitioning of simulations for parallel execution is a timely research 
problem. A simulation's run-time performance depends heavily on the nature of the inputs 
the simulation responds to. Consequently, a simulation's run-time behavior varies as a 
function of time. Since a simulation's run-time behavior is generally too complex to 
analytically predict, partitioning algorithms must be statistically based: they base their 
partitioning decisions on the simulation ... 

Rhythmic-motion synthesis based on motion-beat analysis 
Tae-hoon Kim, Sang II Park, Sung Yong Shin 

July 2003 ACM Transactions on Graphics (TOG), volume 22 issue 3 

Full text available: ^ pdf(6.29 MB) Additional Information: full citation , abstract , references , index terms 

Real-time animation of human-like characters is an active research area in computer 
graphics. The conventional approaches have, however, hardly dealt with the rhythmic 
patterns of motions, which are essential in handling rhythmic motions such as dancing and 
locomotive motions. In this paper, we present a novel scheme for synthesizing a new motion 
from unlabelled example motions while preserving their rhythmic pattern. Our scheme first 
captures the motion beats from the example motions to extract ... 

Keywords: beat analysis, motion blending, motion signal processing, motion 
synchronization, motion synthesis, motion transition 



13 A reliability model c o mbining re presentative and directed testing 
Brian Mitchell, Steven J. Zeil 

May 1996 Proceedings of the 18th international conference on Software engineering 

Full text available: 'S.Pdf(92 9.88 KB ) Additional Information: full citation , abstract , references , citings , index 
W Publisher Site i^nns 

Directed testing methods, such as functional or structural testing, have been criticized for a 
lack of quantifiable results. Representative testing permits reliability modeling, which 
provides the desired quantification. Overtime, however, representative testing becomes 
inherently less effective as a means of improving the actual quality of the software under 
test. A model is presented which permits representative and directed testing to be used in 
conjunction. Representative testing can be use ... 

Keywords: directed testing, failure rates, fault detection, fault revelation, functional 
testing, interfailure time, order statistics, post-mortem debugged fault analysis, program 
debugging, program diagnostics, program testing, quantification, random processes, random 
variable, reliability estimate updating, reliability model, representative testing, software 
quality, software reliability, statistics, structural testing 



14 Variational learning of clusters of undercomplete nonsvmnnetric independent 
components 

Kwokleung Chan, Te-Won Lee, Terrence J. Sejnowski 

March 2003 The Journal of Machine Learning Research, volume 3 

Full text available* IS pdf(345 39 KB) Additional Information: full citation , abstract , references , citings, index 
• : terms 

We apply a variational nnethod to automatically determine the number of mixtures of 
independent components in high-dimensional datasets, in which the sources may be 
nonsymmetrically distributed. The data are modeled by clusters where each cluster is 
described as a linear mixture of independent factors. The variational Bayesian method yields 
an accurate density model for the observed data without overfitting problems. This allows 
the dimensionality of the data to be identified for each cluster. Th ... 
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Keywords: Bayesian learning, ICA, density estimations, nnlxture models 



Pursuing failure: the distribution of program failures in a profile space 
William Dickinson, David Leon, Andy Podgurski 

September 2001 ACM SIGSOFT Software Engineering Notes , Proceedings of the 8th 
European software engineering conference held jointly with 9th ACM 
SIGSOFT international symposium on Foundations of software 
engineering, volume 26 issue 5 

Full text available: 1ipdf( 304.58 KB ) Additional Information: Mcitation, abM, references, citings, index 

terms 

Observation-based testing calls for analyzing profiles of executions Induced by potential test 
cases, in order to select a subset of executions to be checked for conformance to 
requirennents. A family of techniques for selecting such a subset is evaluated experimentally. 
These techniques employ automatic cluster analysis to partition executions, and they use 
various sampling techniques to select executions from clusters. The experimental results 
support the hypothesis that with appropriate profil ... 

Keywords: adaptive sampling, cluster analysis, cluster filtering, failure-pursuit sampling, 
multivariate data analysis, observation-based testing, software testing 



''6 Tests for gene clustering | 
Dannie Durand, David Sankoff 

April 2002 Proceedings of the sixth annual international conference on Computational 
biology 

Full text available:^ pdf(2.14 MB ) Additional Information: full citation , abstract , references , index terms 

Comparing chromosomal gene order in two or more related species is an important 
approach to studying the forces that guide genome organization and evolution. Linked 
clusters of similar genes found in related genomes are often used to support arguments of 
evolutionary relatedness or functional selection. However, as the gene order and the gene 
complement of sister genomes diverge progressively due to large scale rearrangements, 
horizontal gene transfer, gene duplication and gene loss, it becomes ... 

17 An approach to detecting changes in the factors affecting the performance of computer | 
s ystems 

Robert Berry, Joseph Hellerstein 

April 1991 ACi^ SIGMETRICS Performance Evaluation Review , Proceedings of the 1991 
ACM SIGMETRICS conference on Measurement and modeling of computer 
systems, volume 19 Issue 1 

Full text available: Q pdf(1.19 MB ) Additional Information: full citation , references , index terms 



Inferrin g domain-domain interactions from protein-protein interactions 
Minghua Deng, Shipra Mehta, Fengzhu Sun, Ting Chen 

April 2002 Proceedings of tlie sixth annual international conference on Computational 
biology 

Full text available: ^ pdf(1.95 MB) Additional Information: full citation , abstract, references , index terms 

Protein-protein interactions are important events in cellular and biochemical processes 
within a cell. Several researchers have undertaken the task of analyzing protein-protein 
interactions covering all genes of an organism by using yeast two-hybrid assays. Protein- 
protein interactions involve physical interactions between protein domains. Therefore, 
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understanding protein interactions at the domain level gives a global view of the protein 
interaction network, and possibly extends functions of pr ... 

^9 Self-adaptive, on-line reclustering of complex object data | 
William J. Mclver, Roger King 

May 1994 ACM SIGMOD Record , Proceedings of the 1994 ACM SIGMOD international 

conference on Management of data, volume 23 issue 2 
Full text available- Is^l odfd 19 MB) Additional Information: full citation , abstract , references , citin gs, index 

A likely trend in the developnnent of future CAD, CASE and office information systems will be 
the use of object-oriented database systems to manage their internal data stores. The 
entities that these applications will retrieve, such as electronic parts and their connections or 
customer service records, are typically large complex objects composed of many 
interconnected heterogeneous objects, not thousands of tuples. These applications may 
exhibit widely shifting usage patterns due to their i ... 

20 A bayesian a p proach to transcript estimation from gene array data: the BEAM | 
techniq ue 

Ron 0. Dror, Jonathan G. Murnick, Nicola A. Rinaldi, Voichita D. Marinescu, Ryan M. Rifkin, 
Richard A. Young 

April 2002 Proceedings of the sixth annual international conference on Computational 
biology 

Full text available: ^ pdf(1.84 MB) Additional Information: full citation , abstract , references , index ternns 

We present a new statistically optimal approach to estimate transcript levels and ratios from 
one or more gene array experiments. The Bayesian Estimation of Array Measurements 
(BEAM) technique uses a model of measurement noise and prior information to estimate 
biological expression levels. It provides a principled method to deal with negative expression 
level measurements, combine multiple measurements, and identify changes in expression 
level. BEAM is more flexible than existing techniques, bee ... 

Keywords: DNA microarrays, affymetrix chips, bayesian estimation, statistical confidence 
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A survey of Web metrics 

Devanshu Dhyani, Wee Keong Ng, Sourav S. Bhowmick 

December 2002 ACM Computing Surveys (CSUR), volume 34 issue 4 

Full text available: pdf(289.28 KB ) Additional Information: full citation , abstract , references , index terms 

The unabated growth and increasing significance of the World Wide Web has resulted in a flurry of 
research activity to improve its capacity for serving infornnation more effectively. But at the heart of 
these efforts lie implicit assumptions about "quality" and "usefulness" of Web resources and services. 
This observation points towards measurements and models that quantify various attributes of web 
sites. The science of measuring all aspects of information, especially Its storage and retrieval or ... 

Keywords: Information theoretic, PageRank, Web graph, Web metrics, Web page similarity, quality 
metrics 



2 Fast detection of communication patterns in distributed executions 
Thomas Kunz, MIchiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies on 
Collaborative research 

Full text available: ^ pdf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications Is a tedious and difficult task. Visualizations based on process- 
time diagrams are often used to obtain a better understanding of the execution of the application. The 
visualization tool we use is Poet, an event tracer developed at the University of Waterloo. However, 
these diagrams are often very complex and do not provide the user with the desired overview of the 
application. In our experience, such tools display repeated occurrences of non-trivial commun ... 

^ Using name-based mappin g s to increase hit rates 
David G. Thaler, Chlnya V. Ravlshankar 

February 1998 lEEE/ACi^ Transactions on Networking (TON), volume 6 issue i 

Full text available: ^ pdf{408.98 KB) Additional Information: full citation , references , citings , index terms 
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Keywords: World Wide Web, caching, client-server systems, computer networks, distributed 
agreement, multicast routing, proxies 



4 Analyzing stability in wide-area network performance 

Hah Balakrishnan, Mark Stemm, Srinivasan Seshan, Randy H. Katz 

June 1997 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 1997 ACi^ 
SIGMETRICS international conference on Measurement and modeling of computer 
systems, volume 25 issue 1 

Full text available: ^ pdf(1.76 MB) Additional Information: full citation , abstract , references , citings, index terms 

The Internet is a very large scale, complex, dynamical system that is hard to model and analyze. In t\ 
paper, we develop and analyze statistical models for the observed end-to-end network performance 
based on extensive packet-level traces (consisting of approximately 1.5 billion packets) collected fronr 
the primary Web site for the Atlanta Summer Olympic Games in 1996. We find that observed mean 
throughputs for these transfers measured over 60 million complete connections vary widely as a 
funct ... 

s On the scale and performance of cooperative Web proxy caching 

Alec Wolman, M. Voelker, Nitin Sharma, Neal Cardwell, Anna Karlin, Henry M. Levy 

December 1999 ACM SIGOPS Operating Systems Review , Proceedings of the seventeenth ACM 

symposium on Operating systems principles, volume 33 issue 5 
Full text available: ^ pdf(1 .87 MB) Additional Information: full citation , abstract , references , citings , index terms 

While algorithms for cooperative proxy caching have been widely studied, little is understood about 
cooperative-caching performance in the large-scale World Wide Web environment. This paper uses bo 
trace-based analysis and analytic modelling to show the potential advantages and drawbacks of inter- 
proxy cooperation. With our traces, we evaluate quantitatively the performance-improvement potentic 
of cooperation between 200 small-organization proxies within a university environment, and 



® Special issue on Machine learning methods for text and images: Matching words and pictures 
Kobus Barnard, Pinar Duygulu, David Forsyth, Nando de Freitas, David M. Blei, Michael I. Jordan 
March 2003 The Journal of Machine Learning Research, volume 3 



We present a new approach for modeling multi-modal data sets, focusing on the specific case of 
segmented images with associated text. Learning the joint distribution of image regions and words ha 
many applications. We consider in detail predicting words associated with whole images (auto- 
annotation) and corresponding to particular image regions (region naming). Auto-annotation might h€ 
organize and access large collections of images. Region naming is a model of object recognition as a 
process ... 

^ Collaborative Web caching based on proxy affinities 
Jiong Yang, Wei Wang, Richard Muntz 

June 2000 ACi^ SIGMETRICS Performance Evaluation Review , Proceedings of the 2000 ACM 
SIGMETRICS international conference on Measurement and modeling of computer 
systems, volume 28 issue 1 

Full text available: ^ pdfd.lO MB ) Additional Information: full citation , abstract , references , index terms 

With the exponential growth of hosts and traffic workloads on the Internet, collaborative web caching 
has been recognized as an efficient solution to alleviate web page server bottlenecks and reduce traffi 
However, cache discovery, i.e., locating where a page is cached, is a challenging problem, especially i 
the fast growing World Wide Web environment, where the number of participating proxies can be ver^ 
large. In this paper, we propose a new scheme which employs proxy affinities to mai ... 
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Full text available: pdf(789.04 KB) 
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8 The state of the art in locally distributed Web-server systems 
Valeria Cardellini, Emiliano Casalicchio, Michele Colajanni, Philip S. Yu 
June 2002 ACM Computing Surveys (CSUR), volume 34 issue 2 

Full text available: ^pdf (1.41 MB) Additional Information: full citation , abstract , references , citings , index terms 

The overall increase in traffic on the World Wide Web is augmenting user-perceived response times 
from popular Web sites, especially in conjunction with special events. System platforms that do not 
replicate Information content cannot provide the needed scalability to handle large traffic volumes anc 
to match rapid and dramatic changes in the number of clients. The need to improve the performance • 
Web-based services has produced a variety of novel content delivery architectures. This article w ... 

Keywords: Client/server, World Wide Web, cluster-based architectures, dispatching algorithms, 
distributed systems, load balancing, routing mechanisms 



® Active Proxy-G: o ptimizing the q uery execution process in the g rid 
Henrique Andrade, Tahsin Kurc, Alan Sussman, Joel Saltz 

November 2002 Proceedings of the 2002 ACM/IEEE conference on Supercomputing 

Full text available: ^„ pdf(247.81 KB) Additional Information: full citation , abstract , references 

The Grid environment facilitates collaborative work and allows many users to query and process data 
over geographically dispersed data repositories. Over the past several years, there has been a growin- 
interest in developing applications that interactively analyze datasets, potentially in a collaborative 
setting. We describe the Active Proxy-G service that is able to cache query results, use those results f 
answering new incoming queries, generate subqueries for the parts of a query that cann ... 

''O An investigation of g eo g raphic ma pp ing techniques for internet hosts 
Venkata N. Padmanabhan, Lakshminarayanan Subramanian 

August 2001 Proceedings of the 2001 conference on Applications, technologies, architectures, 
and protocols for computer communications 

Full text available: ^ pdf(319.78 KB) Additional Information: full citation , references , citings 



11 Consistent hashing and random trees: distributed caching protocols for relieving hot spots on th 
World Wide Web 

David Karger, Eric Lehman, Tom Leighton, Rina Panigrahy, Matthew Levine, Daniel Lewin 

May 1997 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing 

Full text available: ' ^pdf(1.73 MB) Additional Information: full citation , references , citings , index terms 



12 Clustering al gorithms for vyireless ad hoc networks 

Lakshmi Ramachandran, Manika Kapoor, Abhinanda Sarkar, Alok Aggarwal 

August 2000 Proceedings of the 4th international workshop on Discrete algorithms and method; 
for mobile computing and communications 

Full text available: pdf(1.04 MB) Additional Information: full citation , abstract , references , citings , index terms 

Efficient clustering algorithms play a very important role in the fast connection establishment of ad ho 
networks. In this paper, we describe a communication model that is derived directly from that of 
Bluetooth, an emerging technology for pervasive computing; this technology is expected to play a 
major role in future personal area network applications. We further propose two new distributed 
algorithms for clustering in wireless ad hoc networks. The existing algorithms often become infeasi ... 

The content and access dynamics of a busy Web site: findings and implications 
Venkata N. Padmanabhan, Lili Qiu 
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August 2000 ACM SIGCOMM Computer Communication Review , Proceedings of the conference < 
Applications, Technologies, Architectures, and Protocols for Computer 

Communication, Volume 30 issue 4 
Full text available: ^pdf(82Q.58 KB) Additional Information: full citation , abstract , references , citings , index terms 

In this paper, we study the dynannlcs of the MSNBC news site, one of the busiest Web sites in the 
Internet today. Unlike many other efforts that have analyzed client accesses as seen by proxies, we 
focus on the server end. We analyze the dynannlcs of both the server content and client accesses mad 
to the server. The former considers the content creation and modification process while the latter 
considers page popularity and locality in client accesses. Some of our key results are: (a) files ... 



Session 10D: mana g ement of comp u tation: Self-or g anized autonomous web proxies 
Markus J. Kaiser, Kwok Ching Tsui, Jiming Liu 

July 2002 Proceedings of the first international joint conference on Autonomous agents and 
multiagent systems: part 3 

Full text available: ^pdf(316.41 KB) Additional Information: full citation , abstract , references , index terms 

With the increasing size of the Internet, proxy servers have emerged as a feasible way to reduce the 
overall network load and latency. More recently researchers focused on new ways to combine multiple 
cooperative proxies into one transparent proxy system to further increase the overall performance gai 
but no work so far was really able to propose an ideal trade-off between content dissemination and 
clustering in a changing environment caching environment. This paper introduces a self-organizin ... 

Keywords: data clustering, load balancing, proxy, self-organization 



A client-aware dispatchin g al g orithm for web clusters providin g multiple services 
Emiliano Casalicchio, Michele Colajanni 

April 2001 Proceedings of the tenth international conference on World Wide Web 

Full text available: ^pdf(311.46 KB) Additional Information: full citation , references , citings . Index terms 



Keywords: clusters, dispatching algorithms, load balancing 



16 Process migration 

September 2000 ACM Computing Surveys (CSUR), Volume 32 Issue 3 

Full text available: ^ pdf(1.24 IVIB ) Additional Information: full citation , abstract , references , citings, index terms , revi^ 

Process migration is the act of transferring a process between two machines. It enables dynamic load 
distribution, fault resilience, eased system administration, and data access locality. Despite these goal 
and ongoing research efforts, migration has not achieved widespread use. With the increasing 
deployment of distributed systems in general, and distributed operating systems in particular, process 
migration is again receiving more attention in both research and product development. As hi ... 
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During the past years, several attempts have been made to develop functionality for mobility 
management support and QoS provision in the realm of the IP networks. Since IP was not designed tc 
support such functionality, new protocols have been specified and implemented to tackle these issues 
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Mobile IP is currently the dominant protocol that allows users to retain connectivity while roaming in I 
networks. RSVP (Resource reservation Protocol) is a well established protocol for reserving network .. 

Keywords: QoS, RSVP, mobile IP, mobility management 
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Full text available: ^ pdf(2.39 MB) Additional Information: full citation , abstract , references 

As part of the process of delivering content, devices like proxies and gateways log valuable informatio 
about the activities and navigation patterns of users on the Web. In this study, we consider how this 
navigation data can be used to improve Web search. A query posted to a search engine together with 
the set of pages accessed during a search task is known as a search session. We develop a mixture 
model for the observed set of search sessions, and propose variants of the classical EM a ... 

Keywords: expectation-maximization algorithm, model-based clustering, proxy access logs, query 
clustering, web searching 
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^0 Implications of proxy caching for provisioning networks and servers 
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In this paper, we examine the potential benefits of web proxy caches in improving the effective capac 
of servers and networks. Since networks and servers are typically provisioned based on a high 
percentile of the load, we focus on the effects of proxy caching on the tail of the load distribution. We 
find that, unlike their substantial impact on the average load, proxies have a diminished impact on the 
tail of the load distribution. The exact reduction in the tail and the corresp ... 
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